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DISK CONTROLLER 



BACKGROUND OF THE INVENTION 

A) FIELD OF THE INVENTION 

The present invention relates to a disk 
controller for controlling a plurality of disk drives, 
and more particularly to a high reliability disk 
controller using connection-less type multiplex 
communication . 

B) DESCRIPTION OF THE RELATED ART 

U.S. Patent No. 6,601,134 and No. 2003046460 
disclose a storage system. 

A disk sub-system (hereinafter simply called 
a "sub-system") using magnetic disk drives as storage 
media has an input/output performance lower by three to 
four digits than that of a main storage of a computer 
using semiconductor storages as storage media. A lot 
of effort has been put into reducing this difference, 
i.e., improving the input/output performance of the 
sub-system. One method of improving the input/output 
performance of the sub-system is to use a disk 
controller which controls a plurality of magnetic disk 
drives into which data is distributively stored. 

For example, a conventionally known disk 
controller such as shown in Fig. 16 has a plurality of 
channel adapters 2100 which execute data transfer 
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between a host computer and a disk drive; a plurality 
of cache memory adapters 2300 for temporarily storing 
data to be transferred between the host computer and 
disk drive; a plurality of control memory adapters 2301 
5 for storing control information on the operation of the 
disk controller; and a plurality of switch adapters 
2400 for establishing connections between the cache 
memory adapters and channel adapters. The channel 
adapters 2100 and cache memory adapters 2300 are 

10 interconnected by a data system inner network via the 
switch adapters 2400. The channel adapters 2100 and 
control memory adapters 2301 are interconnected by a 
control system inner network. With these network 
connections, all the channel adapters 2100 can access 

15 the cache memory adapters 2300 and control memory 
adapters 2301 . 

Each channel adapter 2100 has: data link 
engines (DLEs) 2110 for executing packet transfer in 
the data system internal network; DMA controllers 

20 (DMACs) 2120 for executing DMA transfer in the data 
system inner network; a selector 2115 for 
interconnecting DLEs 2110 and DMACs 2120; protocol 
engines (PE) 2130 for controlling communication between 
the host computer and disk drive; ports 2140 for 

25 connection to the host computer or disk drive; DLEs 
2210 for executing packet transfer in the control 
system inner network; DMACs 2220 for DMA transfer in 
the control system inner network; micro-processors 



(MPs) 2230 for controlling the operation of the disk 
controller; and a selector 2125 for interconnecting 
DMACs 2120 and PEs 2130 or MPs 2230. 

The cache memory adapter 2300 and control 
memory adapter 2301 each have: DLEs 2310 for executing 
DMA transfer in the data system internal network or 
control system internal network; DMACs 2320 for 
executing DMA transfer in each inner network; memory 
controllers (MCs) 2330; memory modules (MMs) 2340; a 
selector 2315 for interconnecting DLEs 2310 and DMACs 
2320; and a selector 2325 for interconnecting DMACs 
2320 and MCs 2330. 

The switch adapter 2400 has: DLEs 2410 for 
executing packet transfer in the data system inner 
network; DMACs 2420 for executing DMA transfer in the 
data system inner network; and a selector 2430 for 
interconnecting DMACs 2420. 

Data transfer between the adapters is 
realized by cooperative operations of DMACs in the 
respective adapters. As an example of this, with 
reference to Figs. 18 and 19, description will be made 
on an outline operation of DMA transfer of data from 
the host computer to the cache memory adapter 2300 in 
the disk controller. 

. When a WRITE request is issued from the host 
computer via the connection port 2140, MP 2230 
calculates an area of the cache memory adapter for 
temporarily storing WRITE data, and notifies the 



calculated result to DMAC 2120 in the channel adapter 
as a DMA list 2600. DMAC 2120 issues requests 2605 for 
acquiring paths to the cache memory adapters necessary 
for DMA transfer. Since the WRITE data is stored in a 
plurality of cache memory adapters (two cache memory 
adapters having DMAC 2321 and DMAC 2322) in order to 
improve the reliability, a plurality of path 
establishing requests are issued. After necessary 
paths are established, DMAC 2120 transfers the WRITE 
data to DMAC 2420 at the relay point switch, in 
accordance with the contents of the DMA list 2600. In 
this case, the WRITE data is transferred from the host 
computer by dividing it into a data amount having a 
predetermined size . 

DMA 2420 of the switch adapter 2400 generates 
DMA sub-requests 2611 and 2612 for DMACs 2321 and 2322 
of the cache memory adapters, in accordance with the 
transfer requests sent from DMAC 2120 of the channel 
adapter 2100. In response to the requests 2611 and 
2612, DMACs 2321 and 2322 return sub-statuses 2621 and 
2622 which are the request completion notices. After 
DMAC 2120 of the channel adapter confirms the sub- 
statuses 2621 and 2622, it issues the next DMA sub- 
request. When the sub-statuses of all the DMA sub- 
requests are returned, DMAC 2120 issues release 
requests 2625 for the established paths to the cache 
memory adapters, and returns a completion status 2630 
to MP 2230 to thereby complete the process for the DMA 
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list 2600. During the DMA transfer, MP 2230 accesses 
the control memory adapter 2301 when necessary. In 
this case, similar DMA transfer is performed between 
DMAC 2220 of the channel adapter 2100 and DMAC 2320 of 
5 the control memory adapter 2301. 

Fig. 17 shows the structure of a packet used 
by DMA transfer. A command packet 2520 has: an address 
field 2521 for indicating a targeting DMAC; an address 
field 2522 for indicating an initiating DMAC; memory 

10 address fields 2523 and 2524 for indicating memory 
addresses at which transfer data is stored; and an 
error check code 2525. 

The path establishing request 2605 is issued 
by using the command packet 2520. A data packet 2530 

15 has: an address field 2531 for indicating a targeting 
DMAC; an address field 2532 for indicating an 
initiating DMAC; transfer data 2533; and an error check 
code 2535. The DMA sub-request is issued by using the 
data packet 2530. 

20 Fig. 20 illustrates a transfer protocol for 

the path request command 2605 and DMA sub-request 2610. 
In order to facilitate a failure recovery process, 
processes are all executed by non-multiplex 
communication. Namely, after it is confirmed that the 

25 sub-status 2620 for the DMA sub-request 2610 is 

returned, the next DMA sub-request 2610 is issued. 
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SUMMARY OF THE INVENTION 

As described above, DMA transfer in a 
conventional disk controller described in the above- 
cited Patent documents is performed by connection type 
5 non-multiplex communication because of easy 

implementation. Namely, DMAC establishes the paths 
necessary for the execution of DMA transfer, and during 
DMA transfer the paths are occupied (connection type 
communication) . Moreover, until the sub-status for the 

10 DMA sub-transfer immediately before is confirmed, the 
next DMA sub-request cannot be executed (non-multiplex 
communication) . 

A conventional disk controller has therefore 
a low use efficiency of the inner network paths, which 

15 may hinder the performance improvement. In order to 
satisfy the conditions that the necessary path 
bandwidth is reserved at the limited path use 
efficiency, a complicated inner network configuration 
is required such as implementation of both the data 

20 system inner network and control system inner network, 
resulting in a high cost. 

An object of the present invention is to 
provide a disk controller using connection-less type 
multiplex communication, capable of addressing issues 

25 of the prior art, realizing a high transfer efficiency 
(performance) while retaining a high reliability 
equivalent to that of a conventional disk controller, 
and realizing a low cost. 



In order to solve the above-described issues, 
the present invention adopts the following 
configuration . 

A disk controller includes: a channel adapter 
having a connection interface to a host computer or a 
disk drive; a memory adapter for temporarily storing 
data to be transferred between the host computer and 
disk drive; a processor adapter for controlling 
operations of the channel adapter and memory adapter; 
and a switch adapter for configuring an inner network 
by interconnecting the channel adapter, memory adapter 
and processor adapter, wherein: the channel adapter, 
memory adapter, processor adapter and switch adapter 
each include a DMA controller for performing a 
communication protocol control of the inner network; 
and packet multiplex communication is performed among 
the DMA controllers provided in the adapters. 

According to the invention, by adopting 
connection-less type multiplex communication, multiplex 
becomes possible not only during one DMA sub-transfer 
(as will be later described, transfer state of the sub- 
DMA and sub-status shown in Fig. 11) but also during a 
plurality of sub-DMA transfers (alternative transfer 
state of sub-DMA 615 and sub-DMA 616 shown in Fig. 11) . 
The path use efficiency can be improved considerably 
and it is not necessary to separately provide a control 
system inner network and a data system inner network as 
in the case of a conventional disk controller. 
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Accordingly, the cache memory adapter and control 
memory adapter are integrated to a memory adapter. 
Since the path use efficiency is improved, the path use 
limitation is relaxed so that the processor in the 
5 channel adapter can be used in the processor adapter 
which is independent from the channel adapter. A disk 
controller can be realized which has a high performance 
and a low cost and is excellent in scalability. 

BRIEF DESCRIPTION OF THE DRAWINGS 
10 Fig. 1 is a diagram showing the overall 

structure of a disk controller according to an 

embodiment of the invention. 

Fig. 2 is a diagram showing an example of the 

specific structure of a data link engine used by each 
15 adapter of the disk controller according to the 

embodiment . 

Fig. 3 is a diagram showing an example of the 
specific structure of a DMA controller used by each 
adapter of the disk controller according to the 
20 embodiment. 

Fig. 4 is a diagram showing the structure of 
a channel adapter of the disk controller according to 
the embodiment . 

Fig. 5 is a diagram showing the structure of 
25 a processor adapter of the disk controller according to 
the embodiment. 

Fig. 6 is a diagram showing the structure of 
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a memory adapter of the disk controller according to 
the embodiment. 

Fig. 7 is a diagram showing the structure of 
a switch adapter of the disk controller according to 
5 the embodiment. 

Fig. 8 is a diagram showing the structure of 
a packet used by the disk controller according to the 
embodiment . 

Fig. 9 is a diagram illustrating a packet 
10 flow used by the disk controller according to the 
embodiment . 

Fig. 10 is a diagram illustrating a protocol 
used by the disk controller according to the embodiment 
Fig. 11 is a diagram illustrating a multiplex 
15 communication transfer protocol used by the disk 
controller according to the embodiment. 

Fig. 12 is a diagram illustrating a DMA 
sequence field update flow during DMA sub-transmission 
used by the disk controller according to the embodiment 
20 Fig. 13 is a diagram illustrating a DMA 

sequence field confirmation flow during sub-status 
reception used by the disk controller according to the 
embodiment . 

Fig. 14 is a diagram showing the overall 
25 structure of a disk controller according to another 
embodiment of the invention. 

Fig. 15 is a diagram showing the overall 
structure of a disk controller according to still 
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another embodiment of the invention. 

Fig. 16 is a diagram showing the overall 
structure of a conventional disk controller. 

Fig. 17 is a diagram showing the structure of 
5 a packet used by the conventional disk controller. 

Fig. 18 is a diagram illustrating a packet 
flow used by the conventional disk controller. 

Fig. 19 is a diagram illustrating a protocol 
used by the conventional disk controller. 
10 Fig. 20 is a diagram illustrating a non- 

multiplex communication protocol used by the 
conventional disk controller. 

DESCRIPTION OF THE EMBODIMENTS 

Embodiments of a disk controller of this 

15 invention will be described in detail with reference to 
Figs. 1 to 15. 

Fig. 1 is a diagram showing the overall 
structure of a disk controller according to an 
embodiment of the invention. The disk controller of 

20 this embodiment has: a channel adapter 100 having an 
interface 140 for connection to a host computer or a 
disk drive; a memory adapter 300 for temporarily 
storing data to be transferred between the host 
computer and disk drive; a processor adapter 200 for 

25 controlling the operations of the channel adapter 100 
and a memory adapter 300; and a switch adapter 400 
constituting an inner network by interconnecting the 



channel adapter 100, memory adapter 300 and processor 
adapter 200. 

The channel adapter 100, processor adapter 
200, memory adapter 300 and switch adapter 400 have DMA 
controllers (DMACs) 120, 220, 320 and 420, respectively, 
the DMA controllers performing a communication protocol 
control of the inner network. Switch adapters can be 
connected each other by their expansion ports 440. 
DMACs execute DMA transfer with involvement data link 
engines (DLEs) 110, 210, 310 and 410, respectively. 
Connection-less type packet multiplex communication 
shown in Fig. 11 is performed among these DMA 
controllers . 

Fig. 11 is a diagram illustrating a multiplex 
communication transfer protocol used by the disk 
controller according to the embodiment of the invention. 
As shown in Fig. 11, without confirming a sub-status 
for a DMA sub-request, the next DMA sub-request is 
issued (multiplex communication, i.e., multiplex 
communication during one DMA sub-transfer) . In 
addition, DMA transfer between DMA1 and DMA 2 and DMA 
transfer between DMA 3 and DMA 4 share the same path 
between DLE1 and DLE2 (connection-less type 
communication) . In the example shown in Fig. 11, a 
sub-DMA 615 and a sub-DMA 616 are alternately 
transferred by sharing the same path between DELI and 
DLE2 to perform multiplex communication. As will be 
understood from the description of Fig. 8 to be later 
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given, the connection-less type multiplex communication 
becomes possible by adopting the packet structure that 
contains information (TASK ID) for the sequence control 
of a destination, data and a DMA sub-request. 

In the example shown in Figs. 1 and 11, 
adopting the connection-less type multiplex 
communication allows multiplex not only during one DMA 
sub-transfer but also during a plurality of DMA sub- 
transfers. Therefore, the path use efficiency can be 
improved greatly (because data transfer can be 
performed without a time interval between paths) . It 
is quite unnecessary to separately implement the 
control system inner network and data system internal 
network as made conventionally. It is therefore 
possible to use the memory adapter integrating the 
cache memory adapter and control memory adapter, and 
moreover to use the processor adapter independent from 
the channel adapter because the path use limit is 
relaxed. A disk controller of a low cost and excellent 
in scalability and flexibility can therefore be 
achieved . 

Fig. 5 is a diagram showing an example of the 
specific structure of a processor adapter of the disk 
controller according to the embodiment of the invention, 
and Fig. 2 is a diagram showing the specific structure 
of a data link engine used by the processor adapter. 
The structure of the data link engine (DLE) shown in 
Fig. 2 can be applied not only to the processor adapter 



but also to other adapters. 

The processor adapter 200 shown in Fig. 5 
has: micro-processors (MPs) 230; a plurality of DMA 
controllers 220 and one or more data link engines 
(DLEs) 210. A selector 225 interconnects MPs 230 and- 
DMACs 220, and a plurality of DMA controllers 220 share 
DLEs 210 via the selector 215. Namely, the number of 
DMACs is usually much larger than the number of DLEs . 

Since a DMA arbiter 2150 of the selector 215 
arbitrates requests from a plurality of DMA controllers 
220, DMA transfer from a plurality of DMACs via the 
same DLE 210 can be executed at the same time 
(connection-less communication) . Reception data from 
DLE 210 is distributed by a DLE arbiter 2155 to a 
target DMAC 22 0. 

As shown in Fig. 2, DLE has a transmission 
port 1101, a transmission buffer 1102, a reception port 
1105, a reception buffer 1106, a retry logic 1110 and a 
retry buffer 1120. The retry buffer and retry logic 
perform a process of realizing error free transfer at 
the data link. Namely, a packet sent from the 
transmission buffer to the transmission port is stored 
in the retry buffer 1120 by the retry logic 1110. A 
status representative of whether the packet reached 
correctly is returned to the reception port, and if an 
error is reported, the packet is again sent from the 
retry buffer by the retry logic. The DLE structure 
shown in Fig. 2 allows a data link error control in the 



packet unit and realizes multiplex communication. 

With the example of the structure shown in 
Figs. "5 and 2, connection-less type multiplex 
communication becomes possible and a disk controller 
can be realized which has a high performance and is 
flexible and simple and of a low cost- 
Fig. 4 is a diagram showing an example of the 
specific structure of the channel adapter of the disk 
controller according to the embodiment of the invention, 
and Fig. 3 is a diagram showing an example of the 
specific structure of the DMA controller used by the 
channel adapter. The structure of the DMA controller 
(DMAC) shown in Fig. 3 is applicable not only to the 
channel adapter but also to other adapters. 

The channel adapter shown in Fig. 4 has 
protocol engines 130, DMACs 120 and DLEs 110. PE 130 
and DMAC 120 are connected by a selector 125, and DMAC 
120 and DLE 110 are connected by a selector 115. Each 
DMA controller 120 has a plurality of reception FIFO 
buffers VC0 and VC1 and a plurality of transmission 
FIFO buffers VC0 and VC1. 

The DMA controller 120 shown in Fig. 3 is 
constituted of a multiplexer 1201, transmission FIFO 
buffers 1202, a demultiplexer 1205, reception FIFO 
buffers 1206, a transaction logic 1210, a sequence 
management table 1220, a packet assembly logic 1230 and 
a packet disassembly logic 1240. An arbiter 1212 
arbitrates contention of transmission data among a 



plurality of transmission FIFO buffers 1202 and the 
multiplexer 1201 selects the transmission data. 

Similarly, the demulitplexer 1205 selects 
reception data under the control by the arbiter 1212 
and stores it in a proper FIFO buffer among a plurality 
of reception FIFO buffers 1206. The packet assembly 
logic 1230 and packet disassembly logic 1240 are logic 
circuits for assembling and disassembling the packet. 
The sequence control logic 1213 and sequence management 
table 1220 manage the DMA sequence of DMA sub-transfers, 
the description of this operation being later given. 

With the example shown in Figs. 4 and 3, a 
plurality of buffers VC0 and VC1 can be used for each 
DLE. For example, one DLE can use a mixture of the 
control system inner network and data system inner 
network (for example, VC0 is used for the data system 
inner network, and VC1 is used for the control system 
network) . The arbiter 1212 can operate to set a 
priority order to a plurality of buffers. For example, 
if the control system inner network is set to have a 
priority over the data system inner network, it is 
possible to avoid a longer access delay time of the 
control system inner network otherwise caused by a 
mixture of both the networks. Namely, with this 
arrangement, it is possible to realize a disk 
controller of a simpler inner network configuration and 
both the performance improvement and low cost. 

Fig. 6 is a diagram showing an example of the 



specific structure- of the memory adapter of the disk 
controller according to the embodiment of the invention. 
The memory adapter shown in Fig. 6 has memory modules 
(MMs) 340, memory controllers (MCs) 330, DMACs 320 and 
DLEs 310. MC 330 and DMAC 320 are interconnected by a 
selector 325, and DMAC 320 and DLE 310 are 
interconnected by a selector 315. Each DMA controller 
(DMAC) 320 has a reception buffer (VC0 or VC1) and a 
transmission buffer (VC0 or VC1) . Contention of 
transmission data is arbitrated among a plurality of 
transmission FIFO buffers VC0 and among a plurality of 
transmission FIFO buffers VC1 to transfer data to DLE 
310. Similarly, contention of reception data is 
arbitrated, among a plurality of reception FIFO buffers 
VC0 and among a plurality of reception FIFO buffers VC1 
to store data in a proper reception FIFO. 

Arbiters 3250 and 3255 arbitrate the 
contention conditions between DMAC 320 and MC 330. One 
MC can therefore be shared by a plurality of DMACs, and 
the priority order control among DMACs can be realized 
as the function of the arbiters. For example, if DMACs 
for the control system inner network and DMACs for the 
data system inner network are provided and the DMACs 
for the control system inner network are set to have a 
priority over the data system inner network, then 
accesses to the control system inner network can be 
suppressed from being influenced by interference of the 
operation of the data system inner network. 
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With the structure shown in Fig. 6, a 
plurality of DMACs can be used in correspondence with 
one DLE. For example, one DLE has a mixture of the 
control system inner network and data system inner 
5 network. A plurality of DMACs can be used in 

correspondence with one MC allowing a mixture of the 
control system memory and data system memory. With 
this structure therefore, it becomes possible to 
realize a disk controller of a simpler inner network 

10 structure, satisfying both the performance improvement 
and low cost . 

Fig. 8 is a diagram showing an example of the 
specific structure of the packet to be transferred 
among a plurality of DMA controllers in the disk 

15 controller according to the embodiment of the invention. 
The packet 500 shown in Fig. 8 has at least an address 
field 511 for indicating a targeting DMA controller, an 
address field 521 for indicating an initiating DMA 
controller and a DMA sequence field 524 for managing 

20 the transfer sequence when one DMA transfer is divided 
into a plurality of packets. 

In the disk controller according to the 
embodiment of the invention, since DMA transfer is 
performed by connection-less type multiplex communi- 

25 cation, it is necessary to guarantee the transfer 

sequence of DMA and properly perform an error check 
process and a failure recovery process. As a means for 
this, the DMA sequential field is provided so as to 



reliably identify the packet, and this field is 
controlled (preferably sequentially incremented) so as 
to make it unique (distinguishable) in one DMA transfer. 
With the example of the packet structure 
5 shown in Fig. 8, a proper sequence guarantee and its 
check are possible in the DMA transfer by connection- 
less type multiplex communication, and a proper failure 
recovery process can be performed when a failure occurs. 
With this structure, it becomes possible to realize a 

10 disk controller having a high reliability equivalent to 
the reliability of a conventional disk controller. 

The packet 500 shown in Fig. 8 has a first 
address 511 for designating a packet relay DMA 
controller, second and third addresses 522 and 523 for 

15 designating targeting DMA controllers and transfer data 
531 to be transferred to the targeting DMA controllers. 
When a WRITE request is issued from the channel adapter 
100 to the memory adapter 300, the first address 
designates DMAC 420 of the switch adapter and the 

20 second and third addresses designate DMACs 320 of the 

memory adapter. A plurality of addresses of the memory 
adapters are designated in order to improve the 
reliability perform duplicate WRITE for the cache 
memories . 

25 With this packet structure, the DMA transfer 

function including duplicate WRITE can be applied to 
connection-less multiplex communication so that the 
disk controller of a high reliability can be realized. 
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The packet 500 shown in Fig. 8 also has a 
routing header 510 containing control information for 
DLE, a command header 520 containing control 
information for the DMA controller, and a data block 
5 530 containing other data. The routing header 510 has 
a routing header error check code 515 for checking any 
transfer error in the routing header. The command 
header 520 has a command header error check code 525 
for checking any transfer error in the command header. 

10 The data block 530 has a data block error check code 
535 for checking any transfer error in the data block. 

With this packet structure, the routing 
control information, DMAC control information and data 
information can be protected by different error check 

15 codes, resulting in a finer DMA transfer control and a 
finer failure recovery process. Even if the routing 
control information is required to be rewritten such as 
when duplicate WRITE is performed via the switching 
adapter, it is possible to minimize the recalculation 

20 range of the error check code and realize the disk 

controller of a high reliability and a high performance. 

Fig. 9 is a diagram showing the flow of a 
packet used by the disk controller according to the 
embodiment of the invention, and Fig. 10 is a diagram 

25 illustrating a protocol used by the disk controller 

according to the embodiment of the invention. In the 
example shown in Figs. 9 and 10, a DMA sub-request 610 
is issued from DMAC 120 of the channel adapter to DMAC 
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420 of the switch adapter. In the packet of the DMA 
sub-request 610, the initiating address field 521 
designates the channel adapter DMAC 120 as the master 
DMA and the targeting address field 511 designates the 
5 switch adapter DMAC 420. 

The DMA controller 420 sends back completion 
sub-statuses 621 and 622 corresponding to the DMA 
transfer sub-request 610 to the DMA controller 120. 
The completion sub-statuses 621 and 622 contain the 
10 information of the DMA sequence field 524 contained in 
the DMA transfer sub-request 610. The DMA controller 
120 confirms the information in this DMA sequence field 
in order to confirm the transfer sequence of DMA sub- 
transfers . 

15 Fig. 12 is a diagram illustrating a DMA 

sequence field update flow during DMA sub-transmission 
used by the disk controller according to the embodiment 
of the invention, and Fig. 13 is a diagram illustrating 
a DMA sequence field confirmation flow during sub- 

20 status reception used by the disk controller according 
to the embodiment of the invention. Each DMAC holds 
the value of a current DMA sequence field in a variable 
CURR_DMA_SEQ. During the DMA sub-transmission, while 
CURR_DMA_SEQ is incremented, it is inserted into the 

25 DMA sequence field 524 of each transfer packet. Each 
DMAC holds the value of the DMA sub-status to be 
returned next, in a variable NEXT__DMA_SEQ . When the 
DMA sub-status is returned, the value of the DMA 
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sequence is compared with an expected value. If both 
are coincide with each other, the coincident 
NEXT_DMA_SEQ is incremented. If both are not coincide, 
the DMA transfer sub-requests under execution (from 
5 NEXT_DMA_SEQ to CURR_DMA_SEQ) are cancelled and 
thereafter a failure is notified to the processor. 

In the example of the structure shown in Figs 
9 and 10 and Figs. 12 and 13, also for the DMA sub- 
transfer, the transfer sequence of each DMA can be 

10 reliably controlled by using the DMA sequence field 524 
Namely, with this structure, a disk controller of a 
high reliability can be realized using connection-less 
multiplex communication . 

Figs. 9 and 10 also illustrate a packet flow 

15 (protocol) of duplicate WRITE used by the disk 

controller according to the embodiment of the invention 
In this example of the structure, DMA sub-requests 611 
and 612 are issued from the channel adapter DMAC 120 to 
the memory adapter DMACs 321 and 322 via the switch 

20 adapter DMAC 420. In the packet of the DMA sub-request 
610, the initiating address field 521 designates the 
channel adapter DMAC 120, the targeting address field 
511 designates the switch adapter DMAC 420, the 
targeting field 511 designates the memory adapter DMACs 

25 321 and 322, and the data block (field) 531 stores the 
transfer data. 

The DMA controller 420 of the switch adapter 
generates a DMA sub-request packet 611 and a DMA sub- 
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request packet 612 and transfers the packets to the 
respective targeting addresses. The former packet 611 
has DMAC 321 as the targeting address field and 
contains the transfer data 531, and the latter packet 
5 612 has DMAC 322 as the targeting address field and 

contains the transfer data 531. In response to the DMA 
sub-requests 611 and 612, the DMACs 321 and 322 of the 
memory adapter return sub-statuses 621 and 622 to the 
channel adapter DMAC 120 via the switch adapter DMAC 
10 420. 

The example of the structure shown in Figs. 9 
and 10 can realize cache memory duplicate WRITE by the 
switch adapter DMAC. Since DMAC 420 of the switch 
adapter 400 near the memory adapter 300 generates the 

15 packets for duplicate WRITE, the bandwidth of the inner 
network will not be consumed wastefully and the path 
efficiency can be improved. With the example of this 
structure, a disk controller of a high performance and 
a high reliability can be realized. 

20 Fig. 7 is a diagram showing an example of the 

specific structure of the switch adapter of the disk 
controller according to the embodiment of the invention. 
The switch adapter shown in Fig. 7 has a plurality of 
DLEs 410, a plurality of DMACs 420 and a selector 430. 

25 A packet received from a reception side DLE 410 is 

stored distributively in a plurality of reception FIFO 
buffers (VC0, VC1) in a reception side DMAC 420, and 
thereafter, sent to transmission FIFO buffers in a 
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transmission DMAC 420 via selector logic circuits 4301, 
4302, 4306 and 4307 prepared for the respective 
transmission FIFO buffers, and transmitted from a 
transmission side DLE 410. 
5 With the example of the structure shown in 

Fig. 7, similar to the packet having the routing 
control information, DMAC control information and data 
information shown in Fig. 8, a packet to be transferred 
among a plurality of DMA controllers has a header 

10 including targeting DMAC information and a data field 
including other data. The header includes a header 
error check code for checking any transfer error in the 
header. The data field includes a data field error 
check code for checking any transfer error in the data 

15 field. 

Until the header error check code is 
confirmed, the reception side DMA controller 420 in the 
switch adapter will not send the packet to the 
transmission side DMAC. After the header error check 

20 code is confirmed, the header and data field of the 
packet are sent to the transmission side DMAC in a 
pipeline processing manner. If an error is found by 
the header error check code, the packet is discarded 
and a proper error recovery process is executed. 

25 With the example of the structure shown in 

Fig. 7, the switch adapter can start a transmission 
process from the transmission DLE before the whole data 
field is fetched from the reception DLE and the data 



field error check code is confirmed, and the packet 
having an illegal targeting address field because of an 
error in the header is discarded to prevent the 
propagation of the error. With the example of the 
5 structure, a disk controller of a high performance and 
a high reliability can be realized. 

The adapter used by the disk controller 
according to the embodiment of the invention, such as 
the channel adapter shown in Fig. 4 and the processor 

10 adapter shown in Fig. 5, has the structure that a 

plurality of DMACs share a plurality of DLEs . In the 
case of the channel adapter shown in Fig. 4, two DLEs 
and sixteen DMACs are provided and there may be the 
case wherein each DMAC shares a few DLEs. With this 

15 redundancy structure, for example, during DMA 

communication by DMAC via some DLE, even if a failure 
occurs at this DLE, the DMAC arbiter 1150 (refer to Fig. 
4) or 2150 (refer to Fig. 5) performs a routing control 
to connect another DLE. Similarly, the DMAC arbiter 

20 1150 or 2150 performs a routing control for a plurality 
of DMAC processes to distribute the processes to a 
plurality of DLEs and realize load distribution. 

With the example of the structure, the 
arbiter 1150 or 2150 controls to make the same DLE deal 

25 with transmission/reception for a series of DMA sub- 
requests and sub-statuses from the same DMAC. More 
preferably, a transmission/reception in the normal 
operation is fixed for the requests and statuses from 



the same DM AC. 

With the example of the structure shown in 
Figs. 4 and 5, the inner network route is fixed for a 
series of DMA sub-requests and sub-statuses. Therefore, 
5 there is no possibility of a sequence exchange (outrun) 
due to different routes. The sequence control of DMA 
sub-requests and sub-statuses can be facilitated 
greatly. Namely, with the example of the structure, a 
disk controller of a high reliability can be realized 
10 easily. 

Fig. 14 is a diagram showing the overall 
structure of a disk controller according to another 
embodiment of the invention. In this embodiment of the 
invention shown in Fig. 14, a plurality of channel 

15 adapters 100, a plurality of processor adapters 200 and 
a plurality of memory adapters 300 are interconnected 
by a plurality of switch adapters 400. By providing a 
plurality of paths among all the adapters, it becomes 
possible to realize redundancy capable of recovering an 

20 arbitrary one-point failure. The connection of each 
adapter is as shown in Fig. 14. Each adapter has the 
paths for corresponding two adapters. 

According to this embodiment of the invention, 
the reliability can be improved by enhancing the 

25 redundancy of the disc controller system. 

Fig. 15 is a diagram showing the overall 
structure of a disk controller according to still 
another embodiment of the invention. In this 
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embodiment of the invention shown in Fig. 16, two disk 
controllers of the embodiment shown in Fig. 14 are used 
by coupling expansion ports of the switch adapters. 
With this connection, additional channel adapters, 
5 processor adapters and memory adapters can be installed 
so that the system scalability can be improved by using 
the same architecture. With this embodiment of the 
invention, the scalability of the disk controller can 
be improved. 

10 As described so far, adopting the disk 

controller of the embodiments of the invention shown in 
Figs. 1 to 15 can provide the following functions and 
effects. According to the embodiments, a plurality of 
buffers can be set in one-to-one corrrespondence with 

15 one DLE. For example, the control system inner network 
and data system inner network can be mixed in one DLE. 
The arbiter can set the priority order of a plurality 
of buffers. For example, if the control system inner 
network is set to have a priority over the data system 

20 inner network, it is possible to avoid a longer access 
delay time of the control system inner network 
otherwise caused by a mixture of both the networks. 
With this arrangement, it is possible to realize a disk 
controller of a simpler inner network configuration and 

25 both the performance improvement and low cost. 

According to the embodiments, a plurality of 
DMACs can be set in one-to-one correspondence with one 
DLE. For example, the control system inner network and 



data system inner network can be mixed in one DLE. A 
plurality of DMACs can be set in one-to-one 
correspondence with one MC, so that the control system 
inner network and data system inner network can be 
5 mixed. A disk controller of a simpler inner network 
structure can be realized, satisfying both the 
performance improvement and low cost. 

According to the embodiments, a proper 
sequence guarantee and its check are possible in the 

10 DMA transfer by connection-less type multiplex 

communication, and a proper failure recovery process 
can be performed when a failure .occurs. With this 
structure, it becomes possible to realize a disk 
controller having a high reliability equivalent to the 

15 reliability of a conventional disk controller. 

According to the embodiments, the routing control 
information, DMAC control information and data 
information can be protected by different error check 
codes, resulting in a finer DMA transfer control and a 

20 finer failure recovery process. Even if the routing 

control information is required to be rewritten such as 
when duplicate WRITE is performed via the switching 
adapter, it is possible to minimize the recalculation 
range of the error check code and realize the disk 

25 controller of a high reliability and a high performance. 

According to the embodiments, it becomes 
possible to realize cache memory duplicate WRITE by the 
switch adapter DMAC. Since DMAC of the switch adapter 
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near the memory adapter generates the packets for 
duplicate WRITE, the bandwidth of the inner network 
will not be consumed wastefully and the path efficiency 
can be improved. 
5 According to the embodiments, the switch 

adapter can start a transmission process from the 
transmission DLE before the whole data field is fetched 
from the reception DLE and the data field error check 
code is confirmed, and the packet having an illegal 

10 targeting address field because of an error in the 

header is discarded to prevent the propagation of the 
error- According to the embodiments, since the inner 
network route is fixed for a series of DMA sub-requests 
and sub-statuses, there is no possibility of a sequence 

15 exchange (outrun) due to different routes. The 

sequence control of DMA sub-requests and sub-statuses 
can be facilitated greatly. 

According to the embodiments, the reliability 
can be improved by providing the redundance with the 

20 disk controller system. According to the embodiments, 
the scalability of the disk controller can be improved. 

This application relates to and claims 
priority under 35 U.S.C. 119 from Japanese Patent 
Application No. 2004-038459 filed on February 16, 2004 

25 which is cited to support the present invention. 



